Average sentence length |
---|
13.7711 |
sentence length | percentage |
---|---|
3 | 0.8067 |
4 | 2.7267 |
5 | 3.8767 |
6 | 5.4067 |
7 | 6.0067 |
8 | 6.4800 |
9 | 7.0033 |
10 | 7.0367 |
11 | 6.2300 |
12 | 6.3567 |
13 | 5.5600 |
14 | 4.9733 |
15 | 4.6567 |
16 | 3.9400 |
17 | 3.7667 |
18 | 3.3867 |
19 | 2.7400 |
20 | 2.6033 |
21 | 2.1033 |
22 | 1.8600 |
23 | 1.7700 |
24 | 1.4567 |
25 | 1.4167 |
26 | 1.1133 |
27 | 0.9967 |
28 | 0.9433 |
29 | 0.8100 |
30 | 0.7267 |
31 | 0.5233 |
32 | 0.5000 |
33 | 0.4333 |
34 | 0.3733 |
35 | 0.3367 |
36 | 0.2467 |
37 | 0.2900 |
38 | 0.1633 |
39 | 0.1367 |
40 | 0.1133 |
41 | 0.0667 |
42 | 0.0167 |
43 | 0.0133 |
44 | 0.0133 |
46 | 0.0067 |
47 | 0.0100 |
49 | 0.0033 |
Next we measure the length of a sentence by its number of words. Again, we are interested in average length and length distribution.
See above.
For simplicity, the number of words in a sentence is calculated as 1+(Number of blanks in the sentence).
Average length:
select avg(1+char_length(sentence)-char_length( replace(sentence," ",""))) from sentences;
Distribution:
SELECT @all:=count(*) from sentences;
select 1+char_length(sentence)-char_length( replace(sentence," ","")), 100*count(*)/@all from sentences where 50>=1+char_length(sentence)-char_length(replace(sentence," ",""))group by 1+char_length(sentence)-char_length( replace( sentence," ",""));
4.2.1 Length of sentences in characters